ENUMERATING INVARIANT SUBSPACES OF R 



JOSHUA IDE AND LENNY JONES 

Abstract. In this article, we develop an algorithm to calculate the set of all 
integers m for which there exists a linear operator T on R n such that W 1 has 
exactly m T-invariant subspaces. A brief discussion is included as how these 
methods might be extended to vector spaces over arbitrary fields. 



1. Introduction 

The importance of invariant subspaces is undeniable in the study of linear opera- 
tors on vector spaces. One of the most infamous unsolved problems in mathematics, 
known as the Invariant Subspacc Problem, is the question of whether every sepa- 
rable Hilbert space V of dimension greater than 1 over C has a closed, nontrivial 
T-invariant subspace W (i.e. T(W) C W) when T is a bounded linear operator 
on V. For a very recent survey on this problem, see [2]. In this article, our main 
focus is on finite dimensional vector spaces V ~ R n over R. While these spaces 
are Hilbert spaces, we do not require the added structure of an inner product and 
the associated topology. In our situation, it is easy to see that there always exist 
T-invariant subspaces of V. For example, the kernel of T, the image of T, the trivial 
subspace and the space V itself are T-invariant. Moreover, any eigenspaces of T are 
also T-invariant. Aside from these obvious examples, there are other well-known 
techniques for generating T-invariant subspaces. For example, for any vector v G V, 
the cyclic subspace of V spanned by {v,T(v),T 2 (v), . . .} is T-invariant. Also, any 
generalized eigenspace for T is T-invariant. But even armed with these additional 
tools, it is not clear in general how to determine if, for given positive integers m and 
n, there exists a linear operator T on l n such that R™ has exactly m T-invariant 
subspaces. For example, could it be that there exists a linear operator T on R 4 
such that R 4 has exactly 10 T-invariant subspaces? Indeed, no such linear operator 
exists. In this article, we develop an algorithm to calculate the set of all integers 
m for which there exists a linear operator T on R" such that R ra has exactly m 
T-invariant subspaces. We end with a brief discussion as to how these ideas might 
be extended to vector spaces over arbitrary fields. 

2. Notation and Definitions 

In this section we present some notation and definitions from linear algebra and 
combinatorics that are associated to the techniques used in this paper. 
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2.1. Linear Algebra. Throughout this article we let F be a field, and V be an 
n-dimensional vector space over F, so that V ~ F n . We let T £ C(V), where C(V) 
is the space of all linear operators on V. If B is an ordered basis for V, we use 
the notation [T]g to denote the matrix for T relative to the ordered basis B, and 
simply write [T] for the matrix for T relative to the standard basis. We denote the 
image of T and the kernel of T as im(T) and ker(T), respectively. For a subspace 
W of V, we let T\w indicate the restriction of T to W. We let ct ■= ct(x) and 
tut '■= rriT{x) denote the characteristic polynomial and minimal polynomial of T, 
respectively. For the majority of this article, we are concerned with the case when 
F = M. Therefore, we let {e^ | i = 1,2, ... , n} denote the standard basis for R™, 
and we assume that all eigenvalues of T, if any, are real numbers. 

Definition 2.1 (Standard Jordan Block). Let A £ R, and let k > be an integer. 
A standard Jordan block of size k corresponding to A is the k x k matrix 



A 1 
A 1 













A 1 
A 



For any integer r > 0, we also define 



E 



(r) 



.- ker ((J A , fe - \I k ) r ) . 

a + bi with b > 0, and let C — 
. For an integer k > 0, a real Jordan block of size 2k corresponding to £ is 



Definition 2.2 (Real Jordan Block). Let £ 

a —b 
b a 
the 2k x 2k matrix 



T* 



C I 2 
C h 



Let 








C 

c 












C h 
c 








C 
c 



be a 2k x 2k matrix, and for any integer r > 0, define 

C^ r) : =ker((J c %-^) r ). 



Remark 2.3. Note that v = (vi,v 2 ,v 3 , 

v r+2 = ■■■ = Vk = 0. Similarly, v = (i>i, v%, V3, . . . , V2k) G 1 if and only if 

V2r+l = V 2r +2 = ■ ■ ■ = V2k = 0. 



,Vk) £ if and only if iv+i 

r(r) 
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2.2. Compositions, Partitions and Multipartitions. A composition [i of the 
integer n > 0, written /i 1= n, is a finite ordered set of nonnegative integers 
(a*1) A*2) • • • > Mr) such that appears at most once in the list and Xa=i M» = n - 
The integers /Xi are referred to as the parts of /x. We define the length of a composi- 
tion /x = (/Lti, . . . , /x r ), denoted ^(/x), to be r. A partition of n is a composition whose 
parts are all positive and weakly decreasing from left to right. If /x is a partition of n 
we write /ihn. A fi-multipartition ofn corresponding to fi — (/Xi, . . . , /x r ) N n is an 
ordered set of partitions 9 := 6(fi) = (9\, . . . , 9 r ) with #i h /Xj for each £ = 1, . . . , r. 
Note that from any /x-multipartition 9 of n we can derive a unique composition of 
n by removing the internal parentheses. We call this unique composition, which 
we denote d(/i, (9), the derived composition of the /x-multipartition 9. For example, 
if \x = (5, 6, 3) and 9 = ((4, 1), (3, 2, 1), (2, 1)), then d(fx, 9) = (4, 1, 3, 2, 1, 2, 1) of 
n = 14. 

3. The Results 

Proposition 3.1. Let V be a finite dimensional vector space over a field ¥, and 
let S,T € £{V)- If S and T are similar, then V has exactly the same number of 
S -invariant subspaces as T -invariant subspaces. 

Proof. Since 5* and T arc similar, there exists an invertible matrix P such that 
P~ 1 SP = T. First suppose W is an S- invariant subspace of V, and consider the 
subspace P _1 (W) of V. Then 

T(P-\W)) = P^SPiP-^W)) = P-\S(W)) C P~ 1 (W). 

On the other hand, if T(W) C W, then S(P(W)) C P(W). This establishes a 
one-to-one correspondence between the S'-invariant subspaces and the T-invariant 
subspaces, which completes the proof. □ 

The existence of a standard Jordan canonical form for a linear operator T 6 
£(F"), where F is an algebraically closed field, is well-known. What we require here 
is a generalization of this fact for linear operators on R™. In Section 01 we discuss 
a generalization over an arbitrary field. However, that particular generalization, 
when viewed over R, differs from the one we use here. For more information, the 
interested reader should consult pQ. 

Theorem 3.2 (Real Jordan Canonical Form). Let T £ zC(R") and suppose that 
Or = /i • • • f r , where either fj(x) = (x — Xj) lj or fj(x) = ( (x — aj) + b^ J with 
bj > 0. Then there is an ordered basis B of R" such that 



[T]b = 



Ji ••• 

J 2 ■■■ 

•■• Jt-i 

•■• J t 



where either Jj is a standard Jordan block corresponding to the eigenvalue Xj 6 R, 
or Jj is a real Jordan block corresponding to £j = aj + bji. In either case, we refer 
to Jj as a Jordan block of T . 

Remark 3.3. By Proposition 13.11 and Theorem 13. 2[ we may assume without loss 
of generality that [T] is in real Jordan canonical form. 
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Theorem 3.4. Let T 6 C(M. n ), and let a € C be such that cr{a) = 0, where either 
« = Agl or a = a + bi with b > 0. Suppose that [T] = ©j =1 </? is ifte rea/ Jordan 
form for T. If there exist two or more Jordan blocks for T corresponding to a, then 
W 1 has infinitely many T -invariant subspaces. 

Proof. Suppose first that a = A G ffi, so that A is an eigenvalue of T. Let E\ be 
the corresponding cigenspace of T. If T has exactly s > 1 standard Jordan blocks 
corresponding to A, then dim(E\) = s. Then, since s > 1, there are infinitely many 
1-dimensional subspaces of E\, and all such subspaces are T- invariant. Thus, the 
theorem is proved in this case. 

Now suppose that a = a + bi, with b > 0. Assume that T has exactly s > 1 real 
Jordan blocks J\, J2, . . . , J s , of respective sizes 2fci, 2&2, • ■ • , 2fc s , corresponding to 
a. Let U be the subspace of K" spanned by the ordered basis 

{ui, 142,1*3,^4} : = {ei, e 2 , e 2 fc 1+ i, e 2 fc 1+ 2}- 

Choose c, d € R with either c ^ or d ^ 0. Let 

»i = cmi + du3 and iy 2 = cu-i + du^. 

Then wi and W2 are linearly independent, and the space spanned by w\ and W2 is 
a 2-dimensional subspace W of [/. We claim that W is T-invariant. To see this, let 
w £ W and write 

w = r\Wi + r 2 w 2 = r\cui + r 2 cu 2 + ridu^, + r 2 du4, 

where r%,r 2 G R. It is then straightforward to show that 

T(w) = (ari — br 2 )w\ + (bri + ar 2 )w 2 £ W. 

Varying the choices of c and d yields infinitely many such distinct subspaces W of 
U and the proof is complete. □ 

Since it is our goal to enumerate the T-invariant subspaces of R ra , we assume, for 
every zero a of ct described in Theorem 13.41 that T has exactly one corresponding 
Jordan block. One ramification of this assumption is that ct — tut- In addition, 

if C T = Il*=i ft and M a ) = °» then the 

size of the Jordan block corresponding to 
a is ki or 2ki, depending on whether a is real or non-real, respectively. Since the 
Jordan blocks of T are independent [4], we focus on the T-invariant subspaces of 
a single block. Then we can piece together the various T-invariant subspaces from 
each block via direct sums to determine all T-invariant subspaces. 

Lemma 3.5. Let T e C(W l ), and assume that [T] = J, where J is a Jordan block. 

(1) If n = k and J = J\.k, then for each integer r > 0, E^ is T-invariant. 
Also, 

E^ < < . . . < if" 1 ) < E { k) = E[ k+1) =■■ 

with dim(E^) = r for all < r < k. 

(2) If n = 2k and J = J£ k , then for each integer r > 0, is T-invariant. 
Also, 

> <uW<-< < U {k) = U (k+1) =■■■ 

with dim([/i r) ) = 2r for all < r < k. 



ENUMERATING INVARIANT SUBSPACES 



5 



Proof. To prove [3, let v G E^ . Since J\ y k commutes with J\f. — XI k, we have 

(Jx.k - A4) r T(v) = T ( J A . fe - \I k ) r (v) = T(0) = 0, 
so that E^ is T-invariant. The other statements in part QJ follow from the fact 

J J q lb ± -lllVcLIlcLIlL SllltJt" U £ 



that {ei, e2, . . . , e r } is a basis for E^\ 

For[H, as in the proof of part [3, Jj[ r ^ is T-invariant since Jt k commutes with 



Ji k — Dk- The other statements in part[H follow from the fact that {ei, e2, . . . , e^r\ 
is a basis for . □ 

Theorem 3.6. Let T G C(M. n ), and assume that [T] = J, where J is a Jordan 
block. If n = k and J = J\.k, then the T-invariant subspaces of R™ are precisely 

(r) 

E\ i f or < r < k. If n = 2k and J = J£ kl then the T-invariant subspaces ofW 1 

(r) 

are precisely , for < r < k. 

Proof. Let W be a T-invariant subspace of R™. First suppose that n = k and 
J = J\,k- Then, for any w € W, we have that 

(T - XI k )w = T(w) - Xw e W. 

Thus W contains the set of vectors 

A w = {{T-XI k ) i w\i = 0,1,2,...}. 

Choose w = (oi, a2, ■ ■ ■ , a r , 0, . . . , 0) G W such that a r ^ and no other vector in 
W has a nonzero component in any location to the right of r. Then dim(VF) < r 
and 

Aw = {(oi, a 2 , . . . , a r , 0, . . . , 0), (a 2 , a 3 , . . . , a r , 0, . . . , 0), (a 3 , a 4 , . . . , a r , 0, . . . , 0), 
...,(a r ,0,...,0),(0,0,...,0)}. 

Since a r ^ 0, it follows that W contains the set {ei, e2, . . . , e r }. Hence, W = 
from part [3 of Lemma 13.51 

Now suppose that n — 2k and J = J? fc , where ^ = a + bi with 6 > 0. Let W 
be a T-invariant subspace of M. 2k . Let w = (101,102, . . . , w s , . . . , 0) G W such that 
w s ^ and no other vector in W has a nonzero component in any location to the 
right of s. Then s is even. For if w s ^ and s is odd, then the component of T(w) 
at location s + 1 is bw s , which is nonzero since b ^ 0, contradicting our choice of 

(r) 

w. So let s = 2r. Note that W C t/i ; . We proceed by induction on r to show that 
W = C/ C (r) . Observe that 

T(w) — aw 1 
v \ h W2r-lVJ 

w := — 5 = (*,*,...,*, 1, 0, . . . , 0) G W, and 

1 + 



w :=to — Mi 2r -iffi = (*,*,...,*, 0, 1, 0, . . . , 0) G W, 

where the 1 is at location 2r — 1 in u> and the 1 is at location 2r in w. The 
case r = 1 is then immediate. A key observation is that, for any r > 2 and 
w = (*,*,..., *, x, y, 0, 1, 0, ... , 0) where x is in location 2r — 3, the determinant of 
the 4x4 matrix whose rows are the four components of w, T(w), T 2 {w) and T 3 (w) 
in locations 2m — 3, 2m — 2, 2m — 1 and 2m, is 46 4 ^ 0. Thus, when r = 2, we 
have dim(iy) = 4, and hence = U) . Assume by induction, for all r < m— 1, 



(i 
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(r) 

that W = when the rightmost location of a nonzero component for any vector 
in W is 2r. Now suppose r = m, so that the rightmost location of a nonzero 
component of any vector in W is 2m. From our key observation, the four vectors 
w, T(w), T 2 {w) and T 3 (w), where w = (*,*,...,*, x, y, 0, 1, 0, 0) with x in 
location 2m — 3, are linearly independent. Thus, W contains an element of the 
form (*,*,...,*,*, z, 0, ... , 0), where z ^ is in location 2m — 2. Let 

W := {w G W | w = (wi,w 2 , ■ ■ ■ ,w 2m -2,0, ■ ■ ■ ,0)}. 

Clearly, Wq is a subspace of W, and since u>2 m -2 ^ for some w G Wo, we have 



by induction that W = £/ 



(m-l) 
C 



Then since 



(m- 



1} c W c c/^ r 







we see that {ei, e%, . . . , e2 m _2} Q W. Using this information and the fact that 
w, w G W, we get that {e 2m _i, e 2m } C W to conclude that W = U^ m \ which 
completes the proof of the theorem. □ 

In light of the previous results, we let T G £(R n ) with ct = tut = 0*=i /j 3 , 
where each /j is either a quadratic factor with non-real zeros, or a linear factor. 
We also let 



[T] 



Ji 









■h 









Jt-1 










Ji 



=®4 



be the real Jordan form of T, where each Jj corresponds to the factor /■ J in cy, 
and that the real Jordan blocks appear first on the diagonal, reading from upper 
left to lower right, followed by the standard Jordan blocks. Note that if Jj is a real 
Jordan block, then its size is 2k j, while if Jj is a standard Jordan block, its size is 
kj. Thus, if the last row in the last real Jordan block is 2r, then < r < [§J , and 
the last s = n — 2r rows of [T] are dedicated to the standard Jordan blocks. We 
are now in a position to prove the main result. 

Theorem 3.7. For a positive integer r, with < r < [5j , let s = n — 2r and let 
d r (/j,,9) be the derived composition of the fi-multipartition 8, where fi = (r,s) \= n 
and 6 = (^1,^2), with 61 h r and 82 h s. Then the exact set A4 n of positive 
integers m for which there exists T G £(M n ) such that W 1 has exactly m T-invariant 
subspaces is 



Mr 



(d r (fl,6)i 



1) I < r < 



Proof. The real Jordan blocks occupy the first 2r rows of [T]. There could be a 
single block of size 2r; or two blocks, consisting of one of size 2r — 2 and one of size 
2, or one of size 2r — 4 and one of size 4, and so on. Since the order of the blocks is 
irrelevant, these possibilities can be described simply by the partitions of r, where a 
block of size 2k corresponds to a part in the partition of r of size k. Each such block 
contains exactly k + 1 nested T-invariant subspaces by Theorem l3.6l The analysis 
of the standard Jordan blocks in [T] is similar, except there we use partitions of 
s = n— 2r. Since the blocks are independent, any T-invariant subspace will be a 
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direct sum of T-invariant subspaces originating from the individual blocks. Hence, 
for any two particular partitions 9\ h r and O2 h s, we can count the total number 
of T-invariant subspaces by considering the derived composition d r (fi,6), where p, 
is the composition (r,s) and 9 = (0i,02) is a multipartition. Then, each part of 
d r (/j,,9) corresponds to a single Jordan block in [T] and the total number of T- 
invariant subspaces for a fixed r is nf^T {d r (n, 9)i + 1). Letting r range from 
to l§ J exhausts all possibilities and completes the proof. □ 

We end this section with an example. 

Example 3.8. We examine the case of n = 4. We list the possibilities for d r (fi, 9) 
and the corresponding number N r of T -invariant subspaces for each value of r with 
< r < LfJ =2. Recall that s = n- 2r. 
• r = and s = 4. 
Thus fi = (0,4), and 

9 1 e {(0)}, 9 2 G {(4), (3, 1), (2, 2), (2, 1, 1), (1, 1, 1, 1)}. 



d (n,9) 


N 


(0,4) 


5 


(0,3,1) 


8 


(0,2,2) 


9 


(0,2,1,1) 


12 


(0,1,1,1,1) 


16 



Table 1 . Values of d (//, 9) and N 

• r = 1 and s = 2. 

Thus /i = (1, 2), and 

MP}, 2 e {(2), (1,1)}. 
Table 2. Values of di(fx,6) and iVi 



• r = 2 and s = 0. 

Thus fi = (2, 0), and 

9 X 6 {(2), (1,1)}, 9 2 e{(0)}. 
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(2,0) 3 

(1,1,0) 4 



Table 3. Values of d 2 {n, 0) and iV 2 



Combining the information from Tables[]l\^ and\3\ gives 
Mi = {3,4,5,6,8,9,12,16}. 

4. Extending The Results 

In this section we outline an approach, and indicate some of the difficulties 
therein, to extend the results of Section [3] to finite dimensional vector spaces V 
over an arbitrary field F. We point out that this problem has recently been solved 
in the case when F is finite [3]. We begin with a definition. 

Definition 4.1 (Generalized Jordan Block). Let F be a field, and let f(x) S F[x] 
be a monic polynomial of degree d that is irreducible over F. The kd x kd generalized 
Jordan block corresponding to f{x) is the kd x kd matrix 



}(x),k 



■C(f(x)) N 











C7(/(x)) N 



••• C(/(x)) N 

••• C{f{x))_ 

where C(/(x)) is the companion matrix of f(x), and TV is the d x d matrix with a 
1 in location (1, d) and zeros everywhere else. 

Remark 4.2. The idea of a generalized Jordan block can be extended further to 
allow for the possibility that /(x) is not irreducible over F, but such a generalization 
is not needed here pQ. 

Theorem 4.3 (General Jordan Canonical Form). Let V be a finite- dimensional 
vector space over a field F and let T S C(V). Then there is an ordered basis B of 
V such that 

3 = 1 

where each Jj is a generalized Jordan block, and the matrix [T]g is unique up to 
the order of the blocks. 

A careful analysis of the invariant subspaccs of the individual generalized Jordan 
blocks, combined with Theorem 14. 3[ should yield a theorem analogous to Theorem 
13.71 in this situation. One annoyance is that F might not have the property that a 
finite extension of F is algebraically closed, as is the case of R. In other words, there 
might exist irreducible polynomials over F of arbitrarily large degree. However, this 
concern could be handled on somewhat of a case-by-case basis. Since dimF(V) is 
finite, the possibilities for the sizes of the generalized Jordan blocks would be limited 
as well. That is, the largest degree of an irreducible factor of ct would be bounded 
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above by dim$(V). One would still have to consider the possibility of blocks of 
various sizes corresponding to irreducible polynomials whose degrees range from 1 
to dimf(V). 
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